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Abstract. Lagrangian relaxation has been used extensively in the design of approximation 
algorithms. This paper studies its strengths and limitations when applied to Partial Cover. 

We show that for Partial Cover in general no algorithm that uses Lagrangian relaxation 
and a Lagrangian Multiplier Preserving (LMP) a-approximation as a black box can yield an 
approximation factor better than |q. This matches the upper bound given by Konemann 
et al. (ESA 2006, pages 468-479). 

Faced with this limitation we study a specific, yet broad class of covering problems: Par- 
tial Totally Balanced Cover. By carefully analyzing the inner workings of the LMP algorithm 
we are able to give an almost tight characterization of the integrality gap of the standard 
linear relaxation of the problem. As a consequence we obtain improved approximations for 
the Partial version of Multicut and Path Hitting on Trees, Rectangle Stabbing, and Set 
Cover with p-Blocks. 



1. Introduction 

Lagrangian relaxation has been used extensively in the design of approximation algo- 



sign or appro 

rithms for a variety of problems such as TSP |2Cj, |2l|, /c-MST [lij, 0, H, US], partial vertex 
cover Q, /c-median [M, @, H] , MST with degree constraints Q and budgeted MST 

In this paper we study the strengths and limitations of Lagrangian relaxation applied 
to the Partial Cover problem. Let S be collection of subsets of a universal set U with cost 
c : S — » R + and profit p : U — » R+, and let P be a target coverage parameter. A set C C S 
is a partial cover if the overall profit of elements covered by C is at least P. The objective is 
to find a minimum cost partial cover. 

The high level idea behind Lagrangian relaxation is as follows. In an IP formulation 
for Partial Cover, the constraint enforcing that at least P profit is covered is relaxed: The 
constraint is multiplied by a parameter A and lifted to the objective function. This relaxed 
IP corresponds, up to a constant factor, to the prize-collecting version of the underlying 
covering problem in which there is no requirement on how much profit to cover but a penalty 
of \p(i) must be paid if we leave element i £ U uncovered. An approximation algorithm 
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for the prize-collecting version having the Lagrangian Multiplier Preserving (LMP) propert^ 
is used to obtain values Ai and A2 that are close together for which the algorithm produces 
solutions C\ and C2 respectively. These solutions are such that C\ is inexpensive but unfeasible 
(covering less than P profit), and C2 is feasible (covering at least P profit) but potentially 
very expensive. Finally, these two solutions are combined to obtain a cover that is both 
inexpensive and feasible. 

Broadly speaking there are two ways to combine C\ and €2- One option is to treat 
the approximation algorithm for the prize-collecting version as a black box, only making 
use of the LMP property in the analysis. Another option is to focus on a particular LMP 
algorithm and exploit additional structure that it may offer. Not surprisingly, the latter 
approach has yielded better approximation guarantees. For example, for /c-median compare 
the 6-approximation of Jain and Vazirani [26] to the 4-approximation of Charikar and Guha 



; for fe-MST compare the 5-factor to the 3-factor approximation due to Garg [131 ]. 
The results in this paper support the common belief regarding the inherent weakness of 
the black-box approach. First, we show a lower bound on the approximation factor achiev- 
able for Partial Cover in general using Lagrangian relaxation and the black-box approach 
that matches the recent upper bound of Konemann et al. [Hl|]. To overcome this obstacle, 
we concentrate on Kolen's algorithm for Prize-Collecting Totally Balanced Cover [30[]. By 
carefully analyzing the algorithm's inner workings we identify structural similarities between 
C\ and C2, which we later exploit when combining the two solutions. As a result we derive 
an almost tight characterization of the integrality gap of the standard linear relaxation for 
Partial Totally Balanced Cover. This in turn implies improved approximation algorithms for 
a number of related problems. 



1.1. Related Work 

Much work has been done on covering problems because of both their simple and elegant 
formulation, and their pervasiveness in different application areas. In its most general form 
the problem, also known as Set Cover, cannot be approximated within (1 — e)ln|(7| unless 
NP C DTIME ( I [/ 1 log log 1 17 1 ) 0. Due to this hardness, easier, special cases have been studied. 

A general class of covering problems that can be solved efficiently are those whose 
element-set incidence matrix is balanced. A 0, 1 matrix is balanced if it does not contain 
a square submatrix of odd order with row and column sums equal to 2. These matrices were 
introduced by Berge [5j who showed that if A is balanced then the polyhedron {x > : Ax > 1} 
is integral. A 0, 1 matrix is totally balanced if it does not contain a square submatrix with row 
and column sums equal to 2 and no identical columns. Kolen [30J gave a simple primal-dual 
algorithm that solves optimally the covering problem defined by a totally balanced matrix. 
A 0, ±1 matrix is totally unimodular if every square submatrix has determinant or ±1. Al- 
though totally balanced and totally unimodular matrices are subclasses of balanced matrices, 
the two classes are neither disjoint nor one is included in the other. 

Beyond this point, even minor generalizations can make the covering problem hard. For 
example, consider the vertex cover problem: Given a graph G = (V, E) we are to choose 
a minimum size subset of vertices such that every edge is incident on at least one of the 
chosen vertices. If G is bipartite, the element-set incidence matrix for the problem is totally 
unimodular; however, if G is a general graph the problem becomes NP-hard [29|]. Numerous 



The definition of the LMP property is outlined in Section 2. 



LAGRANGIAN RELAXATION AND PARTIAL COVER 



3 



approximation algorithms have been developed for vertex cover [2j]. The best known approx- 
imation factor for general graphs is 2 — o(l) 0, 18, 28]; yet, after 25 years of study, the best 



constant factor approximation for vertex cover remains 2 [§, 0, |22( . This lack of progress has 
led researchers to seek generalizations of vertex cover that can still be approximated within 
twice of optimum. One such generalization is the multicut problem on trees: Given a tree 
T and a collection of pairs of vertices, a cover is formed by a set of edges whose removal 



separates all pairs. The problem was first studied by Garg et al. [14( who gave an elegant 
primal-dual 2-approximation. 

A notable shortcoming of the standard set cover formulation is that certain hard-to- 
cover elements, also known as outliers 01, can render the optimal solution very expensive. 
Motivated by the presence of outliers, the unit-profit partial version calls for a collection 
of sets covering not all but a specified number k of elements. Partial Multicut, a.k.a. k- 
Multicut, was recently studied independently by Levin and Segev (IH and by Golovin et al. 
17 ] , who gave a | + e approximation algorithm. This scheme was generalized by Konemann 



et al. 31] who showed how to design a la + e approximation for any covering problem using 



Lagrangian relaxation and an a-LMP approximation as a black box. (Their algorithm runs 
in time polynomial on \U\, \S\c and the running time of the a-LMP approximation.) 

1.2. Our Results and Outline of the Paper 

Section [3] shows that for Partial Cover in general no algorithm that uses Lagrangian 
relaxation and an a-LMP approximation as a black box can yield an approximation factor 
better than |a. In Section d] we give an almost tight characterization of the integrality gap 
of the standard LP for Partial Totally Balanced Cover, settling a question posed by Golovin 
et al. [17]. Our approach is based on Lagrangian relaxation and Kolen's algorithm. We 
prove that IP < (l + p=r) LP + kc max for any k > 1, where IP and LP are the costs of 
the optimal integral and fractional solutions respectively and c max is the cost of the most 
expensive set in the instance. The trade-off between additive and multiplicative error is 
not an artifact of our analysis or a shortcoming of our approach. On the contrary, this is 
precisely how the integrality gap behaves. More specifically, we show a family of instances 
where IP > (l + ^) LP 

+ nCmax' In other words, there is an unbounded additive gap in 
terms of c max but as it grows the multiplicative gap narrows exponentially fast. 

Finally, in Section [5] we show how the above result can be applied, borrowing ideas 
from [TtI ]. to get a p + e approximation or a quasi-polynomial time p- approximation 



for covering problems that can be expressed with a suitable combination of p totally-balanced 
matrices. This translates into improved approximations for a number of problems: a 2 + e 



approximation for the Partial Multicut on Trees [33J, [l7j] , a 4 + e approximation for Partial 
Path Hitting on Trees [1H], a 2-approximation for Partial Rectangle Stabbing [l5| . and a p 
approximation for Partial Set-Cover with p-blocks [3]. In addition, the e can be removed 
from the first two approximation guarantees if we allow quasi-polynomial time. It is worth 
noting that prior to this work, the best approximation ratio for all these problems could be 
achieved with the framework of Konemann et al. [31]. In each case our results improve the 
approximation ratio by a I multiplicative factor. 
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2. Lagrangian relaxation 

Let S = {1, . . . , m} be a collection of subsets of a universal set U = {1, . . . , n}. Each set 
has a cost specified by c £ -RIl, and each element has a profit specified by p £ -R™ . Given 
a target coverage P, the objective of the Partial Cover problem is to find a minimum cost 
solution C C 5 such that p{C) > P, where the notation p(C) denotes the overall profit of 
elements covered by C. The problem is captured by the IP below. Matrix A = {a>ij} € 
{0, l} nxm is an element-set incidence matrix, that is, = 1 if and only if element i € U 
belongs to set j € S; variable Xj indicates whether set j is chosen in the solution C; variable 
ri indicates whether element % is left uncovered. 

Lagrangian relaxation is used to get rid of the constraint bounding the profit of uncovered 
elements to be at most p(U) — P. The constraint is multiplied by the parameter A, called 
Lagrange Multiplier, and is lifted to the objective function. The resulting IP corresponds, up 
to the constant A (p(U) — P) factor in the objective function, to the prize-collecting version 
of the covering problem, where the penalty for leaving element i uncovered is Xpi. 

min c ■ x min c ■ x + Xp ■ r — A (p(U) — P) 

Ax + Ir > 1 Ax + Ir > 1 



p ■ r < p(U) - P Lagrangian r h Xj £ {0, 1} 

ri , Xj e{0,l} Relaxation 



Let OPT be the cost of an optimal partial cover and OPT-PC(A) be the cost of an 
optimal prize-collecting cover for a given A. Let A be an a-approximation for the prize- 
collecting variant of the problem. Algorithm A is said to have the Lagrangian Multiplier 
Preserving (LMP) property if it produces a solution C such that 

c(C) + a X(p{U) - p{C)) < a OPT-PC(A). (2.1) 

Note that OPT-PC(A) < OPT + A (p(U) - P). Thus, 

c(C) < a (OPT + A (p{C)-P)\ (2.2) 

Therefore, if we could find a value of A such that C covers exactly P profit then C is 
a-approximate. However, if p(C) < P, the solution is not feasible, and if p{C) > P, equation 
(|2.2p does not offer any guarantee on the cost of C. Unfortunately, there are cases where 
no value of A produces a solution covering exactly P profit. Thus, the idea is to use binary 
search to find two values Ai and A2 that are close together and are such that A(X\) covers 
less, and A(X2) covers more than P profit. The two solutions are then combined in some 
fashion to produce a feasible cover. 



3. Limitations of the black-box approach 

A common way to combine the two solutions returned by the a-LMP is to treat the 
algorithm as a black box, solely relaying on the LMP property (12. ip in the analysis. More 
formally, an algorithm for Partial Cover that uses Lagrangian relaxation and an a-LMP 
approximation A as a black box is as follows. First, we are allowed to run A with as many 
different values of A as desired; then, the solutions thus found are combined to produce a 
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feasible partial cover. No computational restriction is placed on the second step, except that 
only sets returned by A may be used. 

Theorem 3.1. In general, the Partial Cover problem cannot be approximated better than |a 
using Lagrangian relaxation and an a-LMP algorithm A as a black box. 

Let Ai,...A q and B\, . . . B q be sets as depicted on the 
right. For each i and j the intersection Ai n Bj consists of a 
cluster of q elements. There are q 2 clusters. Set A4 is made 
up of q clusters; set Bi is made up of q clusters and two addi- 
tional elements (the leftmost and rightmost elements in the 
picture.) Thus \Ai\ = q 2 and \Bi\ = q 2 + 2. In addition, there 
are sets 0%, . . . , O q , which are not shown in the picture. Set 
Oi contains one element from each cluster and the leftmost 
element of Bi. Thus |0,| = q 2 + 1. The cost of Oj is |, the 

cost of Ai is ^ , and the cost of Bi is . Every element has 
unit profit and the target coverage is P = q 3 + q. It is not 

hard to see that Oi, . . . , O q is an optimal partial cover with a cost of 1. Furthermore, for any 
value of A the optimal prize-collecting cover uses sets of one kind. 

Lemma 3.2. For all values of X, the optimal prize- collecting cover in our instance is either 
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the empty cover or A\ 



,A q or Bi, 



B q or Oi,...,O q 



Proof. Suppose that some subsets of the B-sets and the O-sets have already been chosen. 
Every ^4-set has the same marginal benefit (the penalty of elements not-yet-covered by the 
set minus its cost) the independent of the other A-sets. Thus, in an optimal solution either 
all the ^4-sets are chosen or none is. 

Now suppose that the O-sets and the ^4-sets have already been chosen. In this case there 
are two marginal benefits for the -B-sets, depending on whether an O-set already covers the 
leftmost element of the £>-set or not. Thus, an optimal strategy for the -B-sets is either to 
choose none, all, or the complement of the O-sets, i.e., Bi is chosen if and only if Oi is not 
chosen. A solution where -B-sets and O-sets complement each other is always worse than 
either choosing all the O-sets and no -B-sets, or vice versa. Thus, in an optimal solution 
either all the -B-sets are chosen or none is. It follows that the same holds for the O-sets; i.e., 
they are either all in or all out. 

Notice that if the -B-sets are chosen then there is no reason to choose any of the remaining 
sets. If the -B-sets are not chosen and the O-sets are chosen then there is no reason to choose 
the A-sets. The only possibility left is to choose only the A-sets, or the empty cover. ■ 

The a-LMP approximation algorithm we use has the unfortunate property that it never 
returns sets from the optimal solution. 

Lemma 3.3. There exists an a-LMP approximation A that for the above instance and any 
value of X outputs either or A\, . . . , A q or B±, . . . , B q . 

Proof. Let us denote each of the four alternatives in Lemma 13.21 by E, A, B and O. The 
prize-collecting cost of these covers is (q 3 + 2q)X, |a + 2gA, |a and 1 + qX respectively. For a 
fixed value of A, the minimum of these four quantities corresponds to the cost of the optimal 
prize-collecting cover. The line below shows which solution is optimal as a function of A. 
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E A O B 
1 1 1 

2a 3-2a 4a-3 

IF 3q 3q 



If a is big enough then the j4-interval disappears. Namely, if ^% > 3 then the line 
looks as follows 



E O B 
1 1 

1 4a-3 



q 3 +q 3q 

We are now ready to describe the a-LMP algorithm. Given a value of A, we need to 
decide whether to output the empty cover, the ^4-sets or the -B-sets. If A falls in the interval 
corresponding to one of these three solutions then output that solution; since the cover output 
is optimal, the LMP property (|2.ip follows trivially. 

If A falls in the O-interval and A < §- then output the A-sets; the LMP property holds 
since 

c(A) + a\p{A) = ^a + a\2q<a(l + \q) = a (c(0) + A (p(£7) -p(O))). 

If A falls in the O-interval and A > j- then output the -B-sets; the LMP property holds 
since 

c(B) + aXpjB) = < a(l + X q) = a(c(0) + A (p(U) - p(O))) . 



g 



Hence, if we use A as a black box we must build a partial cover with the sets A\, . . . , A, 
and B\, . . . , B q . Note that in order to cover q 2 +q elements either all A-sets, or all B-sets must 
be used. In the first case | additional -B-sets are needed to attain feasibility, and the solution 
has cost |a; in the second case the solution is feasible but again has cost |a. Theorem 13.11 
follows. 

One assumption usually made in the literature 0, 11, 31] is that c max = maxj Cj < e OPT, 



for some constant e > 0, or more generally an additive error in terms of c max is allowed. This 
does not help in our construction as c max can be made arbitrarily small by increasing q. 

Admittedly, our lower bound example belongs to a specific class of covering problem 
(every element belongs to at most three sets) and although the example can be embedded 
into a partial totally unimodular covering problem (see Appendix [A} , it is not clear how to 
embed the example into other classes. Nevertheless, the \a upper bound of Koneman et el. 



311 ] makes no assumption about the underlying problem, only using the LMP property (|2.1|) 
in the analysis. It was entirely conceivable that the |a factor could be improved using a 
different merging strategy — Theorem 13.11 precludes this possibility. 
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4. Partial Totally Balanced Cover 

In order to overcome the lower bound of Theorem 13 .1\ one must concentrate on a specific 
class of covering problems or make additional assumptions about the ct-LMP algorithm. In 
this section we focus on covering problems whose IP matrix A is totally balanced. More 
specifically, we study the integrality gap of the standard linear relaxation for Partial Totally 
Balanced Cover (P-TBC) shown below. 

Theorem 4.1. Let IP and LP be the cost of the optimal integral and fractional solutions of 
an instance of P-TBC. Then IP < (l + 3^t) LP + k c max for any k € Z+. Furthermore, for 
any large enough k £ Z+ the exists an instance where IP > (l + %k-i 

) LP + | 

min c • x max 1 • y — (p(U) — P) A 

Ax + Ir > 1 A T y < c 



p. r <p(U)-P LP Duality ^ X P 

ri,x e >0 Vi,\>0 

The rest of this section is devoted to proving Theorem 14.11 Our approach is based on 
Lagrangian relaxation and Kolen's algorithm for Prize-Collecting Totally Balanced Cover 
(PC-TBC). The latter exploits the fact that a totally balanced matrix can be put into greedy 
standard form by permuting the order of its rows and columns; in fact, the converse is also 
true [1^]. A matrix is in standard greedy form if it does not contain as an induced submatrix 



1 1 
1 



(4.1) 



There are polynomial time algorithms that can transform a totally balanced matrix into 
greedy standard form [37( by shuffling the rows and columns of A. Since this transformation 
does not affect the underlying covering problem, we assume that A is given in standard greedy 
form. 

4.1. Kolen's algorithm for Prize-Collecting Totally Balanced Cover 

For the sake of completeness we describe Kolen's primal-dual algorithm for PC-TBC. 
The algorithm finds a dual solution y and a primal solution C, which is then pruned in a 
reverse-delete step to obtain the final solution C. The linear and dual relaxations for PC-TBC 
appear below. 

min c • x + Xp ■ r max 1 • y 

Ax + Ir >1 A T y < c 



r *'^° LP Duality ^ X P 

The residual cost of the set j w.r.t. y is defined as c'j = Cj — = i ])%■ The algorithm 
starts from the trivial dual solution y = 0, and processes the elements in increasing column 
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Kolen(A, c,p, A) 






1 // Dual update 


7 


// Reverse delete 


2 y^O, C<-0, C*^0 


8 


while C + do 


3 for i <— 1 to n do 


9 


j <— largest set index in C 


4 <5 <— min{c^ | = 1} 


10 


C^C + j 


5 j/i <— min{Ap i , 5} 


11 


C <— C \ { j' j dominates j' or j = j 1 } 


6 C^{j|c;=0} 


12 


return (C, y) 



order of A T . Let i the index of the current element. Its corresponding dual variable, yi, 
is increased until either the residual cost of some set j containing i equals (we say set j 
becomes tight), or yi equals Xpi (Lines 3-5). 

Let C = {j | c'j = 0} be the set of tight sets after the dual update is completed. As it 
stands the cover C may be too expensive to be accounted for using the lower bound provided 
by 1 ■ y because a single element may belong to multiple sets in C. The key insight is that 
some of the sets in C are redundant and can be pruned. 

Definition 4.2. Given sets ji,j2 we say that j% dominates ji in y if j\ > ji and there exists 
an item i such that yt > and i belongs to j% and j2, that is, = ay 2 = 1. 

The reverse-delete step iteratively identifies the largest index j in C, adds j to C, and 
removes j and all the sets it dominates. This is repeated until no set is left in C (Lines 8-11). 

Notice that all sets j € C are tight, thus we can pay for set j by charging the dual 
variables of items that belong to j. Because of the reverse-delete step if yi > then i belongs 
to at most one set in C; thus in paying for C we charge covered items at most once. Using 
the fact A is in standard greedy form, it can be shown (3(J that if i was left uncovered then 
we can afford its penalty, i.e., yi = Xpi. The solution C is optimal for PC-TBC since 

j,zc iG J7 s - t - * G i/ s - t - i€ H s - t - * et/ 

$je6:a,ij=l 3jeC:aij=l $jeC:aij=l 

If we could find a value of A such that Kolen(A, c,p, A) returns a solution (C, y) covering 
exactly P profit, we are done since from (14. 2|) it follows that 

I> = J>-A(p(E0-P). ( 4 -3) 

jec i& 

Notice that (y, A) is a feasible for the dual relaxation of P-TBC and its cost is precisely the 
right hand side of (|4.3p . Therefore for this instance IP=DL=LP and Theorem 14.11 follows . 

Unfortunately, there are cases where no such value of A exists. Nonetheless, we can 
always find a threshold value A such that for any innnitesimally small 5 > 0, A - = A — 5 and 
A + = A + 5 produce solutions covering less and more than P profit respectively. A threshold 
value can be found using Megiddo's parametric search [34] by making O(nlogm) calls to the 
procedure Kolen. For completeness the technique is sketched in Appendix [Bj 

Let y (y~) be the dual solution and C (C~) the set of tight sets when Kolen is run on 
A (A - ). Without loss of generality assume C covers more than P profit. (The case where C 
covers less than P profit is symmetrical: we work with y + and C + instead of y~ and C~.) 
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Our plan to prove Theorem 14.11 is to devise an algorithm to merge C and C in order to 
obtain a cheap solution covering at least P profit. 



4.2. Merging two solutions 

Before describing the algorithm we need to establish some important properties regarding 
these two solutions and their corresponding dual solutions. 

For any i, the value of y^ is a linear function of 5 for all i. This follows from the fact 
that 5 is infinitesimally small. Furthermore, the constant term in this linear function is yt. 

Lemma 4.3. For each i G U there exists a G Z, independent of 5, such that y~ = yi + a5. 

Proof. By induction on the number of iteration of the dual update step of kolen, using the 
fact that the same property holds for the residual cost of the sets. ■ 

A useful corollary of Lemma 14.31 is that C~ C C, since if the residual cost of a set is 
non-zero in y it must necessarily be non-zero in y~ . The other way around may not hold. 

At the heart of our approach is the notion of a merger graph G = (V,E). The vertex 
set of G is made up of sets from the two solutions, i.e., V = C © C~. The edges of G are 
directed and given by 

ji G C~ \ C, j2 G C \ C~ s.t. ji dominates j'2 in y~ , or 
jl G C \ C~, j2 G C~ \ C s.t. ji dominates j'2 in y 
This graph has a lot of structure that can be exploited when merging the solutions. 



e = { Cn,j 2 ) 



(4.4) 





k 


«2 


«2 


h 


j 


1 


1 


1 


1 


h 


1 


m 




1 


h 




1 


1 


S 



Lemma 4.4. The merger graph G = (V,E) of C and C is a forest of out-branchings. 

Proof. First note that G is acyclic, since if (ji, j'2) £ E then necessarily j\ > ]2- Thus, it is 
enough to show that the in-degree of every j G V is at most one. Suppose otherwise, that 
is, there exist G V such that (ji , j) , (j2i j) € E. Assume that ji < j'2 and j G C (the 

remaining cases are symmetrical). 

By definition (|4.4p . we know that ji (j'2) G C~ and that 
there exists i\ (z 2 ) that belongs to j and ji (j'2) such that > 
(y^ 2 > 0) . Since A T is in standard greedy form we can infer that 
z 2 belongs to j\ if i\ < i2, or i\ belongs to J2 if i\ > i2'- The 
diagram on the right shows how, using the fact that A T does 
not contain (|4,ip as an induced submatrix, we can infer that the 

boxed entries must be 1. In either case we get that j'2 dominates j\ in y~ , which contradicts 
the fact that both belong to C~. ■ 

The procedure merge starts from the unfeasible solution D = C~ and guided by the 
merger graph G, it modifies D step by step until feasibility is attained. The operation used 
to update D is to take the symmetric difference of D and a subtree of G rooted at a vertex 
r G V, which we denote by T r . For each root r of an out-branchings of G we set D <— D(BT r , 
until p(D © T r ) > P. At this point we return the solution produced by lNCREASE(r, D). 

Notice that after setting D <— D T r in Line 5, the solution D "looks like" C within 
T r . Indeed, if all roots are processed then D = C. Therefore, at some point we are bound 
to have p(D © T r ) > P and to make the call iNCREASE(r, D) in Line 6. Before describing 
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MERGE((7 _ , C) 

1 let G be the merger graph for C~ and C 

2 D <- C" 

3 for each root r in G do 

4 ifp(DeT r )<P 

5 then D <— D ®T r 

6 else return iNCREASE(r, D) 



increase we need to define a few terms. Let the absolute benefit of set j, which we denote 
by bj, be the profit of elements uniquely covered by set j, that is, 

bj = p ({ i G U | V / G C U C~ : a ir = 1 iff j' = j }) ■ (4.5) 

Let DCCU C _ . Note that if j G D, the removal of j decreases the profit covered by D by 
at least by, on the other hand, if j ^ D, its addition increases the profit covered by at least 
bj. This notion of benefit can be extended to subtrees, 

A(T j ,D)= b f~ E b r- ( 4 - 6 ) 

j'eTj\D j'eTjHD 

We call this quantity the relative benefit of Tj with respect to D. It shows how the profit of 
uniquely covered elements changes when we take D®Tj. Note that A(Tj,D) can positive or 
negative. 

Everything is in place to explain INCREASE^, D). The algorithm assumes the input 
solution is unfeasible but can be made feasible by adding some sets in Ty more precisely, we 
assume p(D) < P and P < p(D) + A(Tj,D). If adding j to D makes the solution feasible 
then return D + j (Lines 2-3) . If there exists a child c of j that can be used to propagate the 
call down the tree then do that (Lines 4-5). Otherwise, split the subtree Ty Add j to D and 
process the children of c, setting D <— D © T c until D becomes feasible (Lines 6-9). At this 
point p(D) > P and p{D@T c ) < P. UP-p{D®T c ) < p(D) -P then call increase(c, D@T c ) 
else call decrease(c, D) and let D' be the cover returned by the recursive call (Lines 10-12). 
Finally, return the cover with minimum cost between D and D' . 

The twin procedure decrease^', D) is essentially symmetrical: Initially the input is 
feasible but can be made unfeasible by removing some sets in Tj] more precisely p(D) > P 
and P <p(D) + A(T C ,D). 

At a very high level, the intuition behind the increase/decrease scheme is as follows. 
In each call one of three things must occur: 

(i) A feasible cover with a small coverage excess is found (Lines 2-3), or 

(ii) The call is propagated down the tree at no cost (Lines 4-5), or 

(iii) A subtree Tj is split (Lines 6-9). In this case, the cost Cj cannot be accounted 
for, but the offset in coverage \P — p(D)\ is reduced at least by a factor of 3. 

If the increase/decrease algorithms split many subtrees (incurring a high extra cost) then 
the offset in coverage must have been very high at the beginning, which means the cost of 
the dual solution is high and so the splitting cost can be charged to it. In order to flesh out 
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f 

INCREASE(j, D) 


DECREASE(j, D) 


1 


// assume p{D) <P< p{D) + A(T 3 -, D) 


1 


// assume p{D) >P> p(D) + A(Tj,D) 


2 


if p(D+j) > P 


2 


if p((D®T j )+j) >P 


3 


then return D + j 


3 


then return (D ®Tj) + j 


4 


if 3 child c of j : p(D) + A(T C , D) > P 


4 


if 3 child c of j : p{D) + A(T C , D) < P 


5 


then return increase(c, D) 


5 


then return DECREASe(c, D) 


6 


D^D+j 


6 


D^D+j 


7 


while p(Z>) < P do 


7 


while > P do 


8 


c <— child of j maximizing A(T C , D) 


8 


c child of j minimizing A(T C , D) 


9 


D^- D®T C 


9 


L> <- L> © T c 


10 


if P p(L> © T c ) < - P 


10 


if p{D®T c )-P <P- p(D) 


11 


then £>' <— increase(c, D © T c ) 


11 


then D' <— increase(c, D) 


12 


else D' <— decrease(c, D) 


12 


else £>' <— decrease(c, D © T c ) 


13 


return min cost {£), £)'} 


13 


return min cost {D © T c , D'} 



these ideas into a formal proof we need to establish some crucial properties of the merger 
graph and the algorithms. 

Lemma 4.5. If yi < Xpi then there exist j' € C and j" E C~ such that either j' = j" or 
(j f , j") EE or (j",j')€E. 



12 n i 

l l l 

i m 

i a m 



Proof. Since yi < Xpi, by Lemma HT31 we get that < i\ ii i 

Xpi as well. Thus, there exists a set j £ C~ such that j 1 1 1 
aij = 1 that becomes tight right after processing i in the ji 1 CD CD 
dual update of kolen. Due to the reverse-delete step j 2 1 |T| 

either j £ C~ or there exists j\ G C~ that dominates j in 

y~ trough some element i\ E U, i.e., y^ > and a^j = ai 1 j 1 = 1. In the latter case, since the 
set j is tight after i is processed, it follows that i\ < i. Because A is in standard greedy form 
we infer that = 1. By Lemma 14.31 we have C~ C C, thus j E C. A similar reasoning 
as above shows that either j G C or there exists j2 £ C that dominates j in y trough some 
element %2 G f7 such that aij 2 = 1 and 12 <i- 

If there exists a set in C~ CI C covering i the lemma follows, so suppose otherwise. If 
j G C~ {j G C) then there exists j% G C (ji G C _ ) that dominates j in y~ (y), and again the 
lemma holds. Finally, consider the case j G" C~ and j ^ C. Assume ji < j'2, the other case 
is symmetrical. Because A is in standard greedy form we get that ai 2 j 1 = 1 and (j2,ji) G E 
if i\ < i2, or aj U ' 2 = 1 and (ji, j'2) G E if ?2 < *i> I n either case the lemma follows. The 
diagram on the right shows a summary of the entries that were inferred using the fact that 
A does not contain (14.11) as an induced submatrix. ■ 



Lemma 4.6. Let (j,D) be the input of increase/decrease. Then at the beginning of each 
call we have j' G D or j" G D for all G E. Furthermore, if j' G D and j" G D then 

j' or j" must have been split in a previous call. 

Proof. We prove the lemma by induction on the number of calls to increase/decrease. 
Clearly, the lemma is true on the first call when \{j' ,j"} fl D\ = 1 for all G E and no 

vertex has been split yet. 
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Suppose the lemma holds at the beginning of a call to increase/decrease, we argue 
that it also holds at the beginning of the next call. If the next call happens in Line 5 then 
the lemma holds trivially since D did change. Otherwise, j is split; namely, j is added to D 
(Line 6) and D is updated by taking D © T c (Line 9) for a number of children c of j. Before 
splitting j, for every child c of j and every (j'j") £ T c we have \{j',j"} fl D\ = 1, and the 
same holds after j is split. Hence, the lemma follows. ■ 

Lemma 4.7. Let (j,D) be the input of increase/decrease. For increase we always have 
p(D) <P< p(D) + A(Tj,D), and for decrease we have p(D) >P> p(D) + A (7), D). 

Proof. By induction on the number of calls to increase/decrease. At the base case the 
call is made by merge and thus we have p{D) < P < p(D © Tj). We claim that 

p(D®T j )-p(D) = A(T j ,D), (4.7) 

from which the lemma follows. We argue that every i £ U contributes the same amount to 
each side of (14.71) . From now on we assume that i is not covered by D \ Tj, otherwise its 
contribution to (14. 7p is zero. 

Suppose i is uniquely covered by some set j E Tj. Then its contribution to both sides of 
(|4.7p is either pi or —pi depending on whether i £ D ®Tj or i £ D. 

Now consider the case when i is covered by multiple sets in Tj. Recall that \{j' , j"}nD\ = 
1 for all (j' , j") G Tj because DdTj = C~ fl Tj. If yi < Xpi then by Lemma [131 is covered by 
both D and D ®Tj. If = Xpi then i covered by at most one set in C~ and at most one set 
in C. It follows that i must be covered exactly by one set in D and another set in D © Tj. 
Hence, the contribution of i to both sides of (|4.7|) is zero. 

For the inductive step suppose the lemma holds at the beginning of this call. Clearly, 
if the next call is made in Line 3 the lemma holds. Suppose that the call is made in Lines 
11-12. Note that after adding j to D (Line 7) we have for every child c of j 

p(D(BT c )-p(D) = A(T c ,D). 

Therefore, by inductive hypothesis, we are bound to exit the while loop and the lemma holds 
in the next call. ■ 

Recall that y is also a feasible solution for the dual relaxation of P-TBC and its cost 
is given by DL = Ya=i ~ (p(P) ~ P)^- The following lemma proves the upper bound of 
Theorem 14.11 

Lemma 4.8. Suppose merge outputs D. Then c(D) < (l + ^Lr DL + fc 

c max for all k £ Z+. 

Proof. Let us digress for a moment for the sake of exposition. Suppose that in Line 6 of 
merge, instead of calling increase, we return D' = D © T r . Notice every arc in the merger 
graph has exactly one endpoint in D' . By Lemma 14.51 anv element i not covered by D' must 
have yi = Xpi. Furthermore, if j/j > then there exists at most one set in D' that covers i; if 
two such sets exist, one must dominate the other in y and y~ , which is not possible. Hence, 

< D )=Y. E^'= E V* =E yi-(p(U)-p(D'))\<DL + (p(D>)-P)\ (4.8) 

7&D' i:oi,'=l i s.t. i£U 

3jeD':a zj =l 

In the fortunate case that (p(D') — P)X < /cc max , the lemma would follow. Of course, this 
need not happen and this is why we make the call to increase instead of returning D' . 

Let j q be the root of the q th subtree split by increase/decrease. Also let D q the 
solution right before splitting Tj q , and D' q and D'' be the unfeasible/feasible pair of solutions 
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after the splitting, which are used as parameters in the recursive calls (Lines 11-12). Suppose 
Lines 7-9 processed only one child of j„, this can only happen in increase, in which case 
p(Dg) > P but p(D'') — bj q < P. The same argument used to derive (|4.8p gives us 

c K \ {3< q }) < E Vi ~ ~ P( D 'D + A ^ DL ( 4 ' 9 ) 

ieu 

The cost of the missing sets is c({j< q }) < qc m3iX , thus if q < k the lemma follows. A similar 
bound can be derived if the recursive call ends in Line 3 before splitting the k th subtree. 
Finally, the last case to consider is when Lines 7-9 process two or more children j q for all 
q < k. In this case 

\p(D q )-P\>3mm{\p(D' q )-P\,\p(D^)-P\}=3\p(D q+1 )-P\, (4.10) 

which implies \p{D x ) -P\> 3 k ~ 1 \p{D h ) - P\> 3 k - 1 \p{D'D - P\. Also, X(P - p(D 1 )) < DL 
since all elements i not covered by D\ must be such that yi = Xpi. Hence, as before 

c {D'i \ {j< k }) < DL + A [p{D'D - P) < DL + A - ' "f™ < (l + DL (4.11) 

Adding the cost of {j<k} we get the lemma. ■ 



4.3. Integrality gap example 

To finish the proof of Theorem 14. II we now show a family of instances of P-TBC exhibiting 
an integrality gap of IP > (l + p=r) LP + |c max for large enough k 

Let T be a rooted tree and {(sj, £i)}f =1 be a collection of pairs of nodes of T, each defining 
a unique path in T. Let A be the incidence matrix of paths to edges of T. The covering 
problem defined by A is the well- know Multicut problem where the objective is to find a 
minimum cost set of edges whose removal separates all pairs. If for every (sj, ti) pair Sj is the 
ancestor of U or vice-versa then A is totally balanced. 

Our tree T is made up of a complete binary tree with height 2q plus a 2-path coming out 
of the apex of the binary tree going up into the real root of T; thus the tree has 2 2<?+1 + 1 
nodes. The reader is referred Figure [1] for a picture of the instance. The cost of every edge is 
3. There are two kinds of paths: internal and fringe paths. For every node in the binary tree 
there is an internal path of length two coming out of the node going up; there are 2 2q+1 — 1 
such paths each having a profit of 4 9 . For each leaf and the root there is a fringe path of 
length 1 incident on it; there are 2 2q + 1 such paths each having a profit of 2. The target 
coverage is given by P = 2 (4 9_1 + . . . + 4° + l) = 2 q+ 3 +4 , where the shorthand notation X 
stands for p(U) — X. 

Consider the dual solution y where every internal path gets a dual value of 1 and every 
fringe path gets a dual value of 2. The solution is feasible for A = 1 and has cost 

DL = ^>-PA= 1049-1 (4.12) 

ieu 

To show that y is optimal, we construct a primal (fractional) solution x with the same 
cost, which is a convex combination of two integral solutions x\ and %i- Let x\ consist of 
edges in every other level of T starting at the leaf level and let 5?2 be the complement of x\. 
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(a) The tree T for q = 2 



(b) Solution x\ 





(c) Solution X2 



(d) Solution x 



Figure 1: Integrality gap example for q 
sponding solution 



2. In (b-d) the wiggly edges belong to the corre- 



(See Figure [TJ Note that p{x\ 
P and let x 



ap(xi) + j3p(x 2 ) 



c(x) 



= 2 and p(x 2 ) = 2 2q+1 . Consider the convex combination 
ax\ + (3x2- Its cost is given by 

ac(xi) + (3c(x2) 



a 



= DL. 

Clearly, x is a feasible fractional solution. Therefore, it is optimal. 

Let x be the solution defined as follows. For edges incident on a leaf and leave out the 
y — 1 rightmost ones and choose remaining ones. For other edges, choose the edge only if 
one of the edges immediately below is not chosen. (See Figure [Q. If we try to pay for x using 
the dual cost we will charge twice 2q — 1 internal paths whose both edges are chosen in x. In 
other words, 

c{x) = DL + 2q—l (4.13) 
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Due to their high profit, internal paths cannot be left uncovered. Using this fact we can 
infer that x is indeed an optimal integral solution covering P profit. Choosing k = q log 3 4 
we get the lower bound of Theorem 14.11 That is, for large enough q, 

c(x) > (l + c(x) + c max ^. (4.14) 

For smaller values of k the slightly weaker bound with c max additive error holds. It 
is worth noting that the example can be adapted to yield the same bound for instances with 
unit profits. 



5. Applications 

In this section we show how Theorem 14.11 implies better approximation algorithms for a 
number of covering problems that can be expressed with a suitable combination of p totally- 
balanced matrices. 

Definition 5.1. Matrix B is said to be row-induced by a collection of matrices A\, . . . € 
R nxrn if for all i, the ith row of B equals the zth row of Aj for some 1 < j < k. 

Definition 5.2. Matrix A € {0, l} nxm is said to be p-separable if there exist matrices 
Ai,...A p € {0, l} xxm such that A = £V A q and every matrix row-induced by A\, . . . , A p is 
totally balanced. 

Our algorithms make us of following lemma to absorb the additive error in our bounds. 

Lemma 5.3. Let A be an algorithm for a given partial covering problem (U,S,P) that pro- 
duces a solution with cost at most a OPT + kc max , where OPT is the cost of the optimal 

k 

solution. Then there exists an a -approximation that makes 1 calls to A. 

Proof. The idea is to run A on a modified instance (U',S',P'). Let X be the — most 
expensive sets in the optimal cover for (U,S,P). Let S' = S\ {j\cj> minj/ g jf Cji}, U' = 
U \{i \ covered by X}, and P' = P — p(X). The optimal solution in the new instance has 
cost OPT' = OPT — c(X). 

Adding X to the solution returned by A(U',S',P') gives us a feasible solution, for the 
original instance, with cost at most 

a OPT' + k 4 ax + c(X) < a OPT' + k ^^c(X) + c(X) = a OPT. 

k 

Unfortunately we do not know which sets comprise X. Therefore A is run on every choice 
of X and the best cover found is returned. The number of calls to A needed is \U\ choose 

a-l ■ ■ 

We are ready to describe our approximation algorithms for covering problems that can 
be described with a p-separable matrix. We assume the decomposition is given to us. For an 
arbitrary matrix finding such a decomposition, or even testing for its existence, may be hard. 
However, for our application problems it is easy to find the p matrices using the problem 
definition. 

Theorem 5.4. Let A be p-separable into matrices A±, . . . ,A p where p > 1. For any constant 
e > there is a (p + e)- approximation and a quasi-polynomial time p- approximation for the 
partial covering problem defined by A. 
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Proof. Our algorithm is based on the approach of jl7l.ll9l|. First, we find an optimal fractional 
solution (x,r) for the partial covering problem defined by A. Let af be the ith row of A q . 
Notice that for each i there must exist a qi such that af-x> - = H. Second, we construct a 
matrix B by choosing af as the ith row of B. Note that B is totally balanced. Finally, we 
find a threshold value A* for B and invoke merge to find a cover D. 

Any feasible solution for B is also feasible for A, thus D is a feasible cover for A. Note 
that (px,r) is a feasible fractional solution for B. Let OPT be the cost optimal solution for 
A. By Lemma 14.81 and letting k > log 3 ^ + 1 we get, 

c{D) < (l + — c(px)+/cc max = f 1 + t^zt) pc(x)+A;c max < (p+e)OPT+A; c max (5.1) 

This solution is p + e approximate with an additive error of k c max that can be absorbed using 
Lemma 15.31 

For the quasi-polynomial time ^-approximation, setting k > log/) \U\ + 1 we get 
c(D) < ( 1 + c(px) + k c max < p c{x) + (k + 1) c max < p OPT + (k + 1) c max . (5.2) 

And the theorem follows. ■ 

This implies improved approximation algorithm for the partial version of Multicut and 
Path Hitting on Trees, and Rectangle Stabbing. To show this, we use the following fact about 
totally balanced matrices. Let T be a rooted tree. An s-t path in T is said to be descending 
if s is an ancestor of t. Let V and Q be collections of descending paths in T, and let A be the 
V-Q incidence matrix A = {aij}, where ai j = 1 if and only if the ith path in V intersects 
the jth path in Q. It is known that A is totally balanced: To put the matrix into Greedy 
Standard form arrange the columns and rows of A so that the paths in V and Q appear in 
non- increasing distance from the root. 

Corollary 5.5. There is a 2 + e approximation and a quasi-polynomial time 2- approximation 
for Partial Multicut on Trees. 

Proof. The input of Partial Multicut is a tree T and a collection of paths V in T, the 
problem is defined by the V-E[T] incidence matrix A. Even though a path in V may not be 
descending, we can always split such a path into two separate descending paths. Therefore, 
A is 2-separable. ■ 

Corollary 5.6. There is a 4 + e approximation and a quasi-polynomial time 4- approximation 
for Path Hitting on Trees. 

Proof. The input of Partial Path Hitting is a tree T and two collections of paths V and Q in T, 
the covering problem is defined by the V-Q incidence matrix A. Parekh and Segev [35[ noted 
that if we split each path in Q into two descending paths the cost of the optimal solution 
increases by at most a factor of 2. The matrix of this modified problem is 2-separable. ■ 

Our last application problem is Rectangle Stabbing. This is a special case of Set Cover 
with p-Blocks, a broad class of covering problems introduced by Hassin and Segev where 
the incidence matrix defining the problem is such that every row has p blocks of contiguous l's. 

Theorem 5.7. Let A be a matrix defining an instance of Set Cover with p-Blocks. Then 
there exists a polynomial time p- approximation algorithm for the partial covering problem 
defined by A. 
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Proof. We proceed as in Theorem 15.41 to reduce A to a matrix B. This new matrix is not 
only totally balanced, but each row consists of a single block of consecutive l's. For such 
matrices the merger graph used in Section 14.21 is in fact a path. In this case, at most one 
subtree is split in the execution of increase. Therefore we get the stronger guarantee that 
IP < LP + 

c max- Plugging in this into the proof of Theorem 15.41 gives the desired result. ■ 

Corollary 5.8. There is a 2- approximation for Partial Rectangle Stabbing and a d- approximation 
for Partial d-dimensional Rectangle Stabbing. 



6. Concluding remarks and open problems 

The results in this paper suggest that Lagrangian relaxation is a powerful technique for 
designing approximation algorithms for partial covering problems, even though the black-box 
approach may not be able to fully realize its potential. 

It would be interesting to extend this study on the strengths and limitation of Lagrangian 
relaxation to other problems. The obvious candidate is the fc-Median problem. Jain and 
Vazirani [2f| designed a 2a-approximation for fc-Median using as a black box an a-LMP 



approximation for Facility Location. Later, Jain et al. [27J gave a 2-LMP approximation 
for Facility Location. Is the algorithm in [26(] optimal in the sense of Theorem 13. Ik * Can 
the algorithm in [13] be turned into a 2-approximation for fc-Median by exploiting structural 
similarities when combining the two solutions? 

The class of totally unimodular matrices is undoubtedly the most important subclass of 
balanced matrices. An open problem is to establish good approximations for Partial Totally 
Unimodular Cover (P-TUC). The matrix used in Section [4.31 is also totally unimodular, so 
the lower bound on the integrality gap applies for P-TUC as well. Does the upper bound of 
Theorem O also hold P-TUC? 
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Appendix A. Partial Totally Unimodular Cover 

Theorem A.l. Partial totally unimodular cover cannot be approximated better than | using 
Lagrangian relaxation and a 1-LMP algorithm A as a black box. 

Proof. The instance is similar to that used in Theorem I3.lt The ^4-sets and the 5-sets are 
the same; for each i we define Oi as Bi minus the rightmost element. The cost of each A, B 
and O set is |, | and 1 respectively. The target coverage parameter is again P = q(q 2 + 1). 

It is straightforward to check that Lemmas 13.21 and 13.31 still holds for our new instance 
and a = 1. Then the same argument used in the proof of Theorem 13.11 gives us a lower bound 
of I . 

It only remains to show that the resulting element-set incidence matrix A is totally 
unimodular. A matrix A is totally unimodular if and only if every submatrix A' of A has an 
equitable coloring [16]. An equitable coloring of a 0,1 matrix A' is a partition of its columns 
into red and blue columns such that in every row of A' the number of blue l's and red l's 
differs by at most one. Let us construct an equitable coloring for A': all the A-sets are colored 
blue; for each i, if Bi and Oi are present in A' then color one red and the other blue, and 
if only one is present then color it red. Clearly the coloring is equitable; thus, A is totally 
unimodular. ■ 

Appendix B. Finding a threshold value 

The idea is to use parametric search treating A as an unknown which lies in a certain 
range (A/,A r ). Initially A; = and A r = maxjj Residual capacities and dual variables 

are kept as a linear function of A. We maintain the invariant that A^~ separates less than P 
profit and A~ covers more than P profit. Suppose that in the interval (A;, A r ) the algorithm 
agrees on the first i elements. By this we mean that if we run the algorithm with any value 
A € (A;, A r ) the value of the dual variables of these i elements (as a function of A) is always 
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the same. In each iteration we either find a threshold value or we narrow the interval such 
that the algorithm agrees on one more element, while maintaining the invariant. This cannot 
go on forever because the algorithm will eventually behave the same throughout the interval 
and the invariant would be violated. If at some point along the way we find a value of A 
covering exactly P profit we stop as the solution is optimal. For simplicity, from now on we 
assume that this never happens. 



Suppose that kolen agrees on the first i — 1 elements in the interval (A/, A r ). Note that 
the residual costs and i's penalty are linear functions of A. As a result, which set has the 
minimum residual cost, and thus which one becomes tight, if any, varies with A. Our goal is 
to narrow the interval such that the set that becomes tight is always the same, or yi = piX 
within the new interval. If we draw the lines corresponding to the residual costs of set that 
i belongs to and piX, the segments on the lower envelope correspond to the next tight event, 
either a set or element i. See Figure EJ Let X±, . . . ,X S correspond to the intersection points of 
the lower envelope, and let Ao = A; and A s +i = A r . For every < a < s, within the interval 
(A , A a +i) the algorithm agrees on i. Note that either one of the A a is a threshold value, or 
there exists an a such that A+ covers less than P profit and A~ +1 covers more than P profit. 
Given the latter we update A; = A a and A r = A a +i and repeat. 

Theorem B.l. A threshold value can be found by making 0{\U\ log |<S|) calls to kolen. 

Proof. The above discussion outlines the algorithm for finding a threshold value. Regarding 
the time complexity, when searching for the next tight event, instead of trying every A a value, 
we can use binary search. Thus, only log |<S| + 1 calls to kolen are needed to find the right 
value of a to narrow the interval. ■ 
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Figure 2: Narrowing the interval for A. 



